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Description 

This invention relates to a method for examining single or multiple nucleotide polymorphisms (or mutations) in a 
nucleotide sequence in particular, but not exclusively, a nucleotide sequence contained within a sample of mammalian 

5 genomic DNA sequence. 

Variations between individuals in the DNA sequence of genes coding for functional proteins account for many so- 
called "genetic diseases*. For example, a single base pair substitution in the beta-globin gene leads to the genetic 
disease known as beta-thalassaemia. Similarly, these variations, particularly in the genes coding for functional Human 
Leucocyte antigens (HLA), explain many of the rejection problems encountered in human allogeneic transplant oper- 

10 ations. Considerable effort is required to "HLA match" the donor to the recipient (patient). 

Recently, a technique known as PCR (polymerase chain reaction) fingerprinting has been developed for human 
HLA-DR/Dw allotype matching (Bidwell JL and Hui KM 1 990). As discussed in this paper, this application of the polymer- 
ase chain reaction technique permits the rapid matching between individuals of HLA-DR-Dw specifies. The technique 
exploits the formation of mismatched DNA hybrids (heteroduplexes) which are formed at the end of each PCR cycle 

is between coamplified products of the second exons of expressed DRB genes (DRB1 , DRB3, DRB4, and DRB5) and 
DRB pseudogenes (DRB2, DRB6, DRB7) (Also see Clay TM, Bidwell JL, Howard MR and Bradley BA 1991 and Wood 
NAP, Clay TM and Bidwell JL 1 991 ). During the primer-annealing stage of each cycle of the PCR, a proportion of coding 
strands of each DRB locus may hybridise to noncoding strands of a different DRB locus and vice-versa. Resulting 
molecules contain double stranded regions where the nucleotide sequence is complementary, but regions of nucleotide 

20 sequence mismatch lorm single stranded loops along the length of the molecule. The number, size and position of the 
single stranded loops vary for each DR haplotype and each combination of haplotypes, in both DR-Dw homozygous 
and in heterozygous individuals. Because the molecular conformations of heteroduplexes vary, they may be separated 
from each other, and from the corresponding homoduptexes, by, for example, gel retardation analysis in nondenaturing 
polyacrylamide gels. In practice, haplotype-specifiC'gel banding patterns (termed PCR fingerprints) are observed. 

25 Matching patterns of PCR fingerprints between two individuals indicate DR-Dw identity, whereas mismatched patterns 
indicate DR-Dw disparity: visual examination of ethidium bromide-stained polyacrylamide minigels thus constitutes a 
very simple test for DR-Dw compatibility. 

PCR fingerprints observed in DR-Dw heterozygous individuals may be exactly reproduced by mixing DNAs from 
two corresponding DR-Dw homozygous typing cells (HTCs) prior to PCR (Bidwell and Hui 1990). However, in DR-Dw 

30 heterozygotes, additional bands may be observed which are not present in either of the patterns from the HTCs. These 
additional bands arise from heteroduplex formation in trans, that is, between PCR products from the two different 
haplotypes. This phenomenon is exploited in the DNA crossmatch test (Wood et al 1991, Clay et al 1991). Here, 
genomic DNAs from two different individuals are coamplified in the PCR. The resulting PCR fingerprint is compared 
with those obtained by separate PCRs for each individual. Matching PCR fingerprints from all three products indicates 

35 DR-Dw identity, whereas mismatch is indicated by different patterns and/or the appearance of new heteroduplexes 
formed in trans. This DNA crossmatching test may be successfully exploited to perform limited DR-Dw "typing": for 
example, DNA crossmatches with DR7-DQ9 and DR9-DQ9 HTCs may be used to discriminate between these alleles 
where conventional DNA-RFLP typing fails. 

Theoretically, the number of conformationally distinct heteroduplexes generated from a DR-Dw homozygous or 

40 heterozygous individual with a total of n different alleles at coamplifiabie loci, is n 2 - n (Bidwell and Hui, 1990). In 
practice, particularly in DR-Dw heterozygotes, not all of the predicted conformationally distinct heteroduplexes are 
observed. A likely reason for this incomplete resolution is that a proportion of heteroduplexes comigrate and are thus 
superimposed in gel retardation analysis. In addition, some heteroduplexes may form less avidly or be less stable than 
others. In general, however, there are more than sufficient discriminatory heteroduplexes to permit matching, but in a 

45 few cases (for example, discrimination between certain DR4 alleles in DR4-homozygous individuals), PCR "spiking" 
may be necessary. PCR spiking is the inclusion or "spiking" of PCRs with DNA from a DR-Dw disparate HTC, for 
example a DR8 HTC. Spiking PCRs in this manner with third party DNA generates new heteroduplexes which enhance 
discrimination between otherwise similar PCR fingerprints (Clay et al 1991 and Bidwell JL 1992). DNA from a ho- 
mozygous DR8 cell line has been successfully used as a "spiking" reagent since it generates a defined but limited 

50 number of new heteroduplexes. 

Although PCR fingerprinting works well for HLA-DR/Dw matching, extension of the principle to matching H LA-DP 
and DQ (other loci in the human HLA region coding for alloantigenic functional proteins) is not directly achievable 
because there are insufficient amplifiable loci. Moreover, the PCR fingerprinting technique is unsuitable as a general- 
ised method for examining single or low numbers of dispersed nucleotide polymorphisms, such as those which result 

55 in genetic diseases. 

According to a first aspect of the present invention, there is provided a method for examining polymorphisms in a 
nucleotide sequence of a first sample of nucleic acid comprising the steps of: 
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(a) forming a population of nucleic acid fragments bearing the said nucleotide sequence; 

(b) combining the population of nucleic acid fragments with a population of a synthetic nucleotide construct which 
construct is capable of forming duplexes with the nucleic acid fragments, the sequence of the construct being such 
that duplexes of different molecular conformation are formed between the construct and the nucleic acid fragments 
dependent upon the presence of a polymorphism at a known variable nucleotide or sequence of nucleotides within 
the sequence of the sample under examination; 

(c) permitting duplex formation within the combined populatidns; and 

(d) separating the duplexes formed in step (c). 

The synthetic construct contains one or more deliberate nucleotide substitution(s) and/or deletion(s) and/or inser- 
tions) which is/are (i) opposite to a known variable nucleotide or sequence of nucleotides within the sequence under 
examination and/or (ii) contiguous with a nucleotide which is opposite to a known variable nucleotide or sequence of 
nucleotides within the sequence under examination. 

By the term "opposite 0 , we mean that, in the resultant duplex between the nucleic acid fragment strand and the 
synthetic construct strand, the deliberate nucleotide substitution(s) and/or deletion(s) and/or insertion(s) is/are in the 
positton(s) in the synthetic construct strand corresponding to the variable nucleotide(s) under examination. The "con- 
tiguous 0 portion of the synthetic construct strand may be a single nucleotide in contact with the nucleotide opposite 
the variable nucleotide(s) or may be a nucleotide sequence, normally no more than 20 bases in length, in which one 
of the end nucleotides is in contact with the nucleotide opposite the variable nucleotide(s). 

For the avoidance of doubt, it should be understood that, in the present invention, the nucleotide substitution(s) 
and/or deletion(s) and/or insertion(s) contained in the sequence of the construct are made either (i) relative to the wild 
type of the nucleotide sequence under examination so that, in the duplexes formed between the nucleic acid fragments 
and the construct, there is created a deliberate or controlled mismatch when the nucleotide sequence contains a mu- 
tation from the wild type or (ii) relative to a mutant of the nucleotide sequence under examination so that, in the duplexes 
formed between the nucleic acid fragments and the construct, there is created a deliberate or controlled mismatch 
when the nucleotide sequence is either wild type or another (different) mutant. 

Preferably, the nucleotide sequence of the synthetic construct at least contains deliberate nucleotide substitution 
(s) and/or deletion(s) and/or insertion(s) contiguous with the nucleotide which is opposite the known variable nucleotide 
or sequence of nucleotides. It may also contain deliberate nucleotide substitution(s) and/or deletion(s) and/or insertion 
3 o (s) opposite the known variable nucleotide or sequence of nucleotides. 

Preferably, the synthetic construct includes nucleotide substitution(s) and deletion(s) both opposite and contiguous 
with the mutation. In practice, it has been found that a sequence of between one and ten substitutions/deletions adjacent 
the position of the polymorphism in the nucleotide sequence under investigation is satisfactory for the purposes of this 
invention. 

In order simultaneously to identify more than one polymorphism in a given nucleotide sequence in a single or 
reduced number of expe riments, the sequence of the construct is such that duplexes of different molecular conformation 
are formed between the construct and the nucleic acid fragments dependent upon the presence or absence of poly- 
morphisms at two or more known variable nucleotides and/or sequences of nucleotides within the sequence of the 
sample under examination; it is therefore preferred that the synthetic construct includes deliberate nucleotide substi- 
tution^) and/or deletion(s) and/or insertion(s) corresponding to at least two known polymorphisms in the nucleotide 
sequence, and preferably corresponding to all of the known polymorphisms in the nucleotide sequence. 

The extent of deliberate nucleotide substitution(s) and/or deletion(s) and/or insertion(s) should not be so great that 
duplex formation between the nucleic acid fragments and the construct is not possible. 

The construct may also include a deliberate deletion or a series of deletions, at a position away from polymorphic 
sites in the nucleotide sequence under investigation (this is in addition to any deletion adjacent to a mutation site) - 
this has the effect of permitting better electrophoretic separation of heteroduplexes from homoduplexes. 

The results of the separation conducted in step (d) of the method of this invention may be compared with the 
results of one or more similar separations conducted on duplexes formed using steps (b) to (d) above with the same 
synthetic nucleotide construct and a population of nucleic acid fragments bearing a nucleotide sequence corresponding 
to the said nucleotide sequence under examination, but taken from one or more different samples of DNA or RNA 
This enables polymorphisms in any of the nucleic acid samples to be detected and compared. 

Typically, the nucleic acid under examination will be DNA, normally mammalian (such as human) genomic DNA 
In its broadest aspect, however, the nucleic acid may be DNA or RNA from any source, genomic or otherwise, for 
example, a virus, bacterium, plant or animal source. 

The synthetic nucleotide construct will normally be a synthetic DNA, because this will form the most stable duplexes 
with DNA fragments containing the nucleotide sequence under investigation. 

Where the nucleic acid under examination is mammalian genomic DNA, a sample of the DNA is obtained from an 
individual or other object whose genotype for a specific characteristic it is wished to study. (The term "individual" is 
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intended to include a foetus.) DNA can be extracted from all nucleated cells. Typically, the DNA is obtained from 
peripheral blood cells for convenience. Foetal DNA can be obtained from placental cells or amniotic fluid. Other sources 
of DNA include hair follicles, mummified bodies, etc. 

The DNA may be isolated by any appropriate method, for example by the rapid salting out method described by 
5 Miller et al (Miller, S., Dykes, D. and Polesky, H. (1988) "A simple salting out procedure for extracting DNA from human 
nucleated cells"; Nucl. Acids Res. 16:1215). Alternatively, the DNA may be isolated as cDNA from mRNA by reverse 
transcription. 

The nucleotide sequence contains the polymorphism under examination; thus, the sequence will include at least 
that part of the relevant gene containing the polymorphism. This will normally be a part of the gene ultimately coding 

10 for a region of the functional protein (i.e. exon), but may alternatively or additionally be an intervening part of the gene 
which is not responsible for coding the functional protein (e.g. an intron) but which nevertheless affects transcription 
and translation or is known to be in genetic linkage with specific polymorphisms in coding sequences of the gene. 
Since two or more loci coding for different forms of the same protein - or representing pseudogenes - may be present 
(as for the DRB genes of the HLA-DR loci) so there may be other forms of the nucleotide sequence under investigation 

15 and these other forms of the nucleotide sequence may be formed into a population of nucleic acid fragments which 
can participate in the duplex formation reaction of step (c) - for example see Bidwell, JL and Hui, KM (1990). 

The population of nucleic acid fragments bearing the nucleotide sequence under examination is formed from the 
first sample of nucleic acid by a method which preserves the relevant sequence information. It is presently preferred 
that the method used is one which directly amplifies the nucleotide sequence in the sample of DNA or RNA (which 

20 may for example be isolated directly from the organism under investigation or may be cDNA derived from mRNA by 
reverse transcription) to result in a population of nucleic acid fragments bearing the nucleotide sequence. Depending 
upon the method of amplification employed, the resulting nucleic acid fragments may be RNA or DNA. However, they 
are preferably DNA, formed from a DNA sample using an enzymatic amplification technique such as the polymerase 
chain reaction (PCR). The technique of "run-off" transcription (Green, MR, Maniatis, T, Melton, DA (1 983) Human beta- 

25 giobtn pre-mRNA synthesised in vitro is accurately spliced in Xenopus oocyte nuclei. Cell 32:681-694) may be used 
to prepare a population of RNA fragments from a DNA sample. 

The preferred method for forming from a DNA source a population of DNA fragments bearing the nucleotide se- 
quence is the polymerase chain reaction (PCR). In this procedure, two oligonucleotide primers are added to the DNA 
source for annealing to complementary sequences at either end of the nucleotide sequence, a heat-stable DNA 

30 polymerase, such as Taq polymerase, dATP, dCTP, dGTP and dTTP and appropriate biological buffers and cofactors. 
The DNA is denatured, the oligonucleotide primers anneal to their complementary sequences with the 3' ends pointing 
towards each other and the DNA polymerase results in extension of the annealed primers and amplification of the 
segment of DNA defined by the 5' ends of the primers. 

Where the synthetic construct is a DNA construct, the population of the synthetic DNA construct may be formed 

35 using the PCR, preferably employing the same primers as used for amplification of the nucleotide sequence under 
consideration although different primers could be used so long as the resulting amplified construct is capable of forming 
the necessary duplexes with the nucleic acid fragments. 

In the PCR, the cycle of DNA denaturation, primer annealing and synthesis of the DNA segment defined by the 5' 
ends of the primers is repeated as many times as is necessary to amplify the nucleotide sequence under consideration 

40 (or the construct) until sufficient is available for step (c) of the present method. Amplification may proceed for from 20 
to 40 cycles, for example from 25 to 35 cycles. Amplification of the nucleotide sequence under investigation and the 
construct may be conducted simultaneously in a mixed PCR or in separate PCRs. 

Any appropriate oligonucleotide primers may be employed. The primers should be suitable for amplification of 
specific locus under investigation. 

45 The primers may be labelled for facilitating the comparison between the results of the separation in step (iii) of the 

present method and the separation conducted on the duplexes formed in the second reaction on the second sample 
of DNA. The primers can be labelled with a directly detectable tag, for example a radionuclide such as 32 P or 35 S, a 
fluorescent compound such as fluorescein, an enzyme such as a horseradish peroxidase or alkaline phosphatase, 
with biotin or digoxigenin. The two primers may have the same or different labels. 

so Before formation of a population of the construct (for example by using the PCR), the construct must first be 

synthesised. Typically, the construct is synthesised from multiple synthetic oligonucleotides, for example. Once syn- 
thesised, it may be cloned in a suitable host, for instance an M13 phage. 

The amplified nucleic acid fragments and the amplified construct obtained from mixed PCR or separate PCRs are 
heated (mixed PCR) or mixed and heated (separate PCRs) to permit separation of the amplified product into single 

55 stranded nucleic acid which is then allowed to re-anneal under suitable annealing conditions to form the duplexes. 
Duplexes will be formed between the coding and non-coding strands present in the reaction vessel and, in particular, 
a proportion of the duplexes so formed will be mismatched hybrids between sense and anti-sense strands of partially 
homologous sequences ("heteroduplexes"), formed either in cis or trans or in both cis and trans. 
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The heteroduplexes formed are separated according to their molecular conformation which affects their apparent, 
but not actual, molecular weight. This may be achieved by, for example, electrophoresis. The separation is typically 
effected on a gel which does not fully denature the nucleic acid, such as a nondenaturing polyacrylamide gel. Electro- 
phoresis is conducted under conditions which effect a desired degree of resolution of the duplexes. A degree of res- 
olution that separates duplexes that differ in "apparent size" - resulting from their different molecular conformations - 
by as little as about 10 bp is usually sufficient. The heteroduplexes are "shifted" away from the homoduplexes in the 
separation by introducing one or more deliberate deletions in the construct as described above; this has the effect of 
retarding the mobilities of all heteroduplexes within the gel. Size markers may also be run on the gel to permit estimation 
of the apparent size of duplexes. 

The distribution, i.e f the resolution pattern, of the heteroduplexes will be allele-specific. This resolution pattern or 
PCR fingerprint can next be visualised. Where a PCR primer has been labelled, this label may be revealed. A substrate 
carrying the separated labelled duplexes is contacted with a reagent which detects the presence of the label. Where 
the PCR primers were not labelled, the substrate bearing the PCR fingerprint may be contacted with, for example, 
ethidium bromide and the nucleic acid fragments visualised under ultraviolet light; alternatively, the heteroduplexes 
may be visualised with silver staining. 

The relative molecular conformation of the duplexes is thus determined, in this embodiment, as a pattern of bands 
representing the duplexes which migrate different distances from the origin on the gel, dependent on their relative 
molecular conformation. This is compared with the relative molecular conformation of duplexes resulting from a similar 
exercise (using the same construct) conducted on a nucleotide sequence of a second sample of nucleic acid. In this 
way, correspondence between the nucleic acid samples may be obtained. It may be ascertained whether the relative 
molecular conformation of the duplexes obtained, and therefore whether the nucleotide sequences of the two samples 
are the same or not. 

The relative molecular conformation in respect of the second sample of the nucleic acid (typically DNA) may be 
obtained using the same conditions as are employed to obtain the relative molecular conformation in respect of the 
first nucleic acid sample. Typically the same primers are used PCR amplification need not necessarily be for the same 
number of cycles or under identical reaction conditions, though. Similarly, separation of the resulting duplexes need 
not be carried out in an identical fashion provided it is possible to assess the relative correspondence of the molecular 
conformation of the duplexes resulting from amplification of each sample. 

The second sample of nucleic acid may be analysed according to the present method simultaneously with or at a 
different time to analysis of the first nucleic cid sample. Indeed, a multiplicity of samples may be analysed. Typically 
the or each sample is a selected sample. Samples from selected individuals can be analysed. The relative molecular 
conformation determined for each sample may be held in a computer. 

A computer database may therefore be generated containing the relative molecular conformation distribution pat- 
terns for different samples. 

The PCR fingerprint using the method of the invention may be compared with another PCR fingerprint to determine 
whether the individuals, whose DNA has been tested to obtain the two fingerprints, match in respect of the particular 
nucleotide sequence under investigation. The present method can therefore be applied to DNA samples from two or 
more individuals. Alternatively, a PCR fingerprint may be compared with a standard or reference fingerprint previously 
obtained. Correspondence between fingerprints can therefore be determined. 

The present method can be used for example to determine whether a donor of a transplant or transfusion and a 
recipient or proposed recipient of the transplant or transfusion have matching allotypes. PCR fingerprints from the 
donor and the recipient or proposed recipient can be compared. The transplant may be a tissue transplant such as a 
heart, lung, liver or kidney transplant or a bone marrow transplant. The transfusion may be a blood transfusion. Matching 
of living related or unrelated donors for allogeneic transplantation may therefore be achieved. 

In particular, the method can be used in matching individuals in respect of one or more of the functional proteins 
expressed by the HLA (human leucocyte antigen) class I and II genes, for example the Class II DPB, DQB, DQA and 
DRB genes which all exhibit polymorphism. This may be used for "typing" individuals and "matching" a patient to a 
donor in a transplant operation. Where the DPB gene is under investigation, it has been found effective to analyse the 
second exon of the gene which is the known polymorphic sequence of the gene. A DNA construct for use in the method 
of the invention for analysis of the DPB1 second exon should preferably have a nucleotide sequence substantially 
corresponding to the invariable regions of the gene and nucleotide substitutions at the known variable nucleotides, for 
instance at one or more of positions 56, 58, 59, 60, 65, 66, 138, 141, 198, 199, 201, 205, 227, 239, 260, 262, 285, 
288, 290, 293 starting from the 3'- end of primer UG19 (a known primer used to prime DNA polymerisation of the 
second exon of DPB1 ). In addition, the construct contains at least one deletion when compared with the gene sequence 
under consideration, for instance a 5 base pair deletion at position 235 starting from the 3'- end of primer UG19 (SEQ 
ID NO: 6). This is in order more effectively to separate the heteroduplexes from the homoduplexes. 

Thus, whilst extension of the heteroduplex principle as described by Bidwell and Hui (1990), supra to H LA-DP 
matching is not directly possible because for DPB1 -specific PCR primers there is generally only a single PCR product, 
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nevertheless it is possible with the method of the present invention to generate PCR fingerprints for the DPB1 locus 
by permitting heteroduplex formation between the population of DNA fragments bearing the nucleotide sequence under 
examination with a synthetic molecular construct (such as that described above) which contains at least one deliberate 
nucleotide substitution, deletion or insertion, designed to be opposite and/or contiguous with a hypervariable nucleotide 

s within, for this example, polymorphic second exons of the target DPB1 genes. These constructs may be used following 
PCR reactions in order to produce heteroduplexes specific for individual DPB1 alleles, and thus to allow rapid matching. 
We have constructed and cloned a construct (see Figure 1) specific for the DPB1 locus which discriminates between 
the 21 specificities DPB1*0101 to DPB1*2001 (Fig 2). In practise, DPB1 second exon sequences are amplified from 
genomic DNA in the normal manner. In a separate PCR and using the same primers, the synthetic M DPB construct", 

10 or universal heteroduplex generator (UHG), is selectively amplified from an M1 3mp1 8 phage clone. Next, aliquots from 
the two separate reactions are mixed together and subjected to 2-3 further cycles of PCR, in order to allow heteroduplex 
formation. Finally, PCR fingerprints are examined in the normal way following polyacrylamide gel electrophoresis. Fig 
3 shows the results obtained for a selection of DPB1 HTCs. In DPB1 heterozygotes, some naturally occurring heter- 
oduplexes formed in trans might be expected, but in practice not all combinations of alleles produce heteroduplexes 

is without use of the method of the present invention employing a DPB construct (Fig 4a). However, conducting the 
method with an amplified DPB construct yields highly discriminatory PCR fingerprints of heteroduplexes (Fig 4b), anal- 
ogous to the DR-Dw system. Newly described alleles (for example, DPB1 *2101 -3601 ) may be catered for by alteration 
to one or more nucleotides within an existing construct. It is envisaged that this method will be particularly appropriate 
tothe DQA1 and DQB1 loci, and in conjunction with group-specific PCR-SSP, to problematic areas in DRB1 , for example 

20 DR4 subtype matching. 

The present method can also be used in determining whether an individual is susceptible to or has a disease 
associated with a particular nucleotide sequence which codes for a functional protein. Such diseases are reviewed in 
Immunol. Rev. 70. 1-218, 1983. 

In a method in accordance with the present invention for the diagnosis of a genetic disease, a suitable synthetic 

25 construct (usually a DNA construct) is employed which is capable of forming duplexes with a nucleotide sequence of 
(a) a translated or non-translated exon region, or (b) an intron region which affects expression or mRNA slicing, the 
nucleotide sequence containing a mutation responsible for a genetic disease, the construct containing one or more 
deliberate nucleotide substitution(s) and/or deletion(s) and/or insertion(s) which is/are (i) opposite to a known variable 
nucleotide or sequence of nucleotides within the sequence under examination and/or (ii) contiguous with a nucleotide 

30 which is opposite to a known variable nucleotide or sequence of nucleotides within the sequence under examination. 
DNA is then isolated from the individual under diagnosis, for example from whole blood obtained according to standard 
protocols or from dried blood spots or from any other suitable source of DNA. Populations of the chosen synthetic 
construct and the nucleotide sequence are formed, for example using the polymerase chain reaction and these pop- 
ulations are then combined under suitable conditions to permit duplex formation within the combined populations. The 

35 duplexes formed are then separated, for example using gel electrophoresis, and the duplexes visualised, for example 
by staining. The resulting profile is then compared with a standard profile representing the results obtained using the 
same synthetic construct either with a DNA sample known to contain the mutation under examination or a DNA sample 
known not to contain the mutation under examination. This enables the user of the method readily to identify whether 
the individual under diagnosis carries the mutation responsible for the genetic disease. 

40 The present invention is particularly suitable where the relevant gene is multiallelic but is not duplicated within the 

genome and therefore DNA heteroduplex formation following the polymerase chain reaction (PCR) is possible only in 
•trans' and therefore only in heterozygotes for a given gene. In accordance with the present invention, DNA heterodu- 
plexes can be generated by hybridising PCR products with synthetic DNA molecules. These are PCR-amplifiable DNA 
sequences that mimic a DNA sequence but differ from it as a result of controlled nucleotide substitution(s), deletion(s) 

45 or insertion(s) at sites continuous to known mutation sites within the DNA. The construct and DNA sequences are 
amplified with the same PCR primers, and hybridised together post-PCR by heating and slow cooling, resulting in DNA 
heteroduplexes having different conformational forms, and thus different electrophoretic mobilities, unique for individual 
alleles of a gene (Bidwell et al, 1992). 

The use of the present method is particularly suitable in, for example, phenylketonuria genotyping. Phenylketonuria 

50 (PKU) is probably the most common disorder of amino acid metabolism and results from mutations at the phenylalanine 
hydroxylase (PAH) gene. The gene consists of 13 exons and encodes a MRNA of 2.8 kilobases (Kwok et al, 1985; 
DiLella et al, 1 986). To date, more than 50 mutations have been described at the PAH gene but there are wide variations 
in the frequency distribution of individual mutations between different populations (Scriver et al, 1992). In the south 
west of England, three mutations in exon 1 2, namely I VS1 2nt 1 , R408W and Y41 4C, account for just over 40% of PKU 

55 chromosomes (Tyfield et al, 1991). In vitro expression analyses reveal 100% loss of PAH activity associated with 
I VS1 2nt1 and R408W mutations, and 50% reduction in activity for the Y414C mutation (Okano et al, 1 991 a, b). Within 
this population, other mutations found in the PAH gene are in exon 3 (I65T) and intron 10 (IVS10nt546), resulting in 
74% and 100% loss of PAH activity respectively (John et al, 1992; Dworniczak et al, 1991). 
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A significant proportion of early treated young adult PKU sufferers show clinical neurological deterioration (Thomp- 
son et al, 1991; Lou et al, 1992), and thus it is important to research the possible link between specific PAH gene 
mutations and long term prognosis. Because of the large number of disease-associated mutations at this locus it is 
desirable to have available a mutation detection system which is rapid, safe and reliable, and is able to screen samples 
for several mutations at one time. 

Several techniques of mutation detection have been described, including those that depend on conformational 
polymorphisms in amplified single stranded DNA (SSCP) (Orita et al,1989), chemical cleavage of mismatched DNA 
sequences in amplified DNA (Cotton et al, 1988), and those that depend on differences in electrophoretic melting 
properties between amplified heteroduplexes and homoduplexes in denaturing or thermal gradient gels (DGGE and 
TGGE) (Fisher and Lerman, 1983). However, none of these techniques is without some limitation regarding safety 
reliability or universal adaptability. 

In a method in accordance with the present invention for the diagnosis of a PKU, it has been found to be effective 
in many cases to analyse exon 12 of the PAH gene which contains at least five known mutation sites leading to full or 
partial inactivation of the PAH gene product. Thus the DNA construct should preferably have a nucleotide sequence 
substantially corresponding to an invariable region or regions of the nucleotide sequence of exon 12 of the gene, with 
at least one nucleotide substitution, deletion or insertion at or adjacent the known variable nucleotides. Thus, for in- 
stance, the DNA construct may have such a deletion, substitution or insertion at or adjacent to one or more of the 
following four positions, defined by reference to the specific codons of the PAH exon 12 wild type gene: 

the 1st nucleotide of codon 408; 
the 2nd nucleotide of codon 41 3; 
the 2nd nucleotide of codon 41 4; and 
the 1st nucleotide of codon 418; 

and may also have such a deletion, substitution or insertion at or adjacent the position corresponding to the 1st nucle- 
otide of the PAH intron 12 wiid type gene. Preferably, deletions and substitutions are employed. 
The invention also provides a test kit, which kit comprises: 

(a) two oligonucleotide primers suitable for use in PCR and capable of annealing to complementary sequences at 
respective ends of a nucleotide sequence to be examined; 

(b) a synthetic nucleotide (preferably DNA) construct which construct is capable of forming duplexes with the 
nucleotide sequence under consideration, the sequence of the construct being such that duplexes of different 
molecular conformation are formed between the construct and the nucleotide sequence under examination de- 
pendent upon the presence of a polymorphism at a known variable nucleotide or sequence of nucleotides within 
the sequence under examination; and 

(c) a control DNA and/or control PCR amplification product. 

The synthetic nucleotide construct contains one or more deliberate nucleotide substitution(s) and/or deletion(s) 
and/or insertion(s) which is/are (i) opposite to a known variable nucleotide or sequence of nucleotides within the se- 
quence under examination and/or (ii) contiguous with a nucleotide which is opposite to a known variable nucleotide or 
sequence of nucleotides within the sequence under examination; 

The primers may be labelled as above. The control PCR amplification product may also be labelled. The kit may 
further comprise one or more of the following: 

a heat-stable DNA polymerase; 

- DATP, DCTP, DGTP and DTTP; 
appropriate biological buffers and cof actors; 

- a database comprising the relative molecular conformation and gel mobilities of DNA fragments generated by PCR 
amplification and heteroduplex formation between selected DNA samples and the construct. 

For a better understanding of the present invention, and to show how the same may be carried into effect, reference 
will now be made to the accompanying drawings, in which: 

Figure 1 shows a "consensus" sequence for the second exon of the DPB-1 gene and a 5'- flanking (intron) region 
and the corresponding sequence of a synthetic DNA construct for use in a method in accordance with this invention. 
The sequence of the synthetic DNA construct is SEQ ID NO: 1 in the attached sequence listing. 
Figure 2 shows a PCR fingerprint for HLA-DPB1 alleles produced in accordance with the present invention using 
a construct having the sequence shown in Figure 1 ; 
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Figure 3 shows the PCR fingerprints for HLA-DPB1 homozygous typing cells using the method of the present 
invention and the construct having the sequence shown in Figure 1 ; 

Figure 4a shows the PCR fingerprints of HLA-DPB1 heterozygotes, without use of a construct; 
Figure 4b shows corresponding PCR fingerprints to those shown in Figure 4a, but using the construct of Figure 1 
5 in the PCR fingerprinting method; 

Figure 5 shows the nucleotide sequence of a portion of the normal human PAH gene, including exon 12 and part 
of the 5' and 3' flanking introns, as well as 5 genotypes including known PKU mutations and a DNA construct for 
use in identifying those mutations; 

Figures 6a, 6b, 7 and 8 show the stained gels for a series of experiments on the of PAH gene exon 12; and 
10 Figures 9(a), 9(b) and (c) model the homoduplexed and synthetic construct-heteroduplexed nucleotides between 

codons 407 and 410 of exon 12 of the PAH gene using in the model the construct sequence which identifies the 
R408W mutation. 

EXAMPLE 1 

15 

A DNA construct specific for the DPB1 locus was synthesised from multiple synthetic oligonucleotide "longmers", 
each of about 100 nucleotides in length, using techniques known in the art. In this example, four iongmers were em- 
ployed having the sequences identified SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 and SEQ ID NO: 5. The DNA 
sequence of the construct was chosen to overlap the start of the second exon of the DPB1 gene, the nucleotide se- 

20 quence of which is known from Marsh, SGE and Bodmer, JG. (1991) "HLA Class II nucleotide sequences, 1991"; 
Tissue Antigens, 37: 181-189, and contained deliberate nucleotide substitutions and a deletion designed to be contig- 
uous with known hypervariable nucleotides within the sequence of the second exon of the DPB1 gene. The construct 
was 338 base pairs in length. The construct sequence included 29 mismatches and 5 unmatched bases between the 
construct and a "consensus" sequence for the second exon and 3* flanking intron of the DPB1 gene, the consensus 

25 sequence being determined by using a computer alignment program, MicroGenie (Version 7.01), Beckman Instru- 
ments. The construct was cloned in an M13mp18 phage. 

The DPB1 "consensus" sequence is the top line in Figure 1. This "consensus" sequence was determined using a 
computer alignment program and variable bases are identified by the letter N. This means that these particular bases 
in the DPB1 second exon vary between individuals and give rise to the various allelic forms of the gene. For simplicity, 

30 the 5'-3' coding strand of the consensus sequence only is shown. 

The bottom line of Figure 1 (also SEQ ID NO: 1) is a DNA construct designed to contain deliberate nucleotide 
substitutions and deletions contiguous with the known variable nucleotides within the nucleotide sequence (here the 
second exon of the DPB1 gene) under examination. Again, the 5'-3* coding stand is shown for simplicity. 

As will be noted, the construct is designed to have an identical sequence with the great majority of the non-variable 

35 nucleotides of the DPB1 consensus sequence, in order that duplexes will form between the DPB1 DNA fragments from 
the sample of genomic DNA under consideration and the construct. In addition, at the known variable nucleotides of 
the DPB1 sequence, deliberate nucleotide substitutions (sometimes hereinbelow referred to as "identifiers" as they 
"identify" any mutation at this position) are created. Each of these deliberate substitutions is marked by an asterisk. 
The substitutions are chosen such that, in different allelic forms of the gene, a different extent of mismatch will exist. 

40 in addition, adjacent the known variable nucleotides, it may be desirable to create a deliberate mismatching with a 
conserved nucleotide; these are underlined in the construct sequence and effectively amplify the adjacent mismatches. 
As a result, hereinbelow, these are occasionally referred to as "amplifiers". The DNA construct shown has a number 
of different mismatches at different variable nucleotides of the consensus sequence and, by careful choice of the 
mismatches, different numbers and extent of single stranded loops will be formed in the duplex between the nucleotide 

45 sequence under investigation and the construct, for different alleles. In addition, a deliberate deletion, for example the 
5 base pair deletion in the construct shown by the "A" in Figure 1 ensures that all duplexes formed between the genomic 
DNA derived from the sample and the construct will migrate at a significantly different rate to the duplexes formed 
within the population of genomic DNA. This "shifts" the bands derived from duplexes including the construct away from 
bands derived from duplexes not including the construct in the PCR fingerprint produced in accordance with the present 

so invention. 

The procedure to produce a PCR fingerprint for the DRB1 loci was as follows. 

The DBP nucleotide sequences under investigation (second exon) were amplified in the polymerase chain reaction 
from genomic DNA. The PCR primers employed were those described by Begovich et al (Begovich AB, Bugawan TL, 
Nepom BS, Klitz W, Nepom GT and Eriich HA (1989), "A specific HLA-DPp allele is associated with pauciarticular 
55 juvenile rheumatoid arthritis but not adult rheumatoid arthritis", Proc Natl Acad Sci USA 86:9489-9493) and which are 
identified as UG19 (SEQ ID NO: 6) and UG21 (SEQ ID NO: 7), available from suppliers of custom oligonucleotides. 
The reaction mixture contained: 
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Genomic DNA @ 0.1jig/u.l 


5.0uJ 


UG 19 primer (5u.M stock) 


10.0m:I 


UG21 primer (5u.M stock) 


10.0uJ 


10X PCR buffer (supplied) 


10.0uJ 


dNTP mix (10mM each DNTP) 


2.0uJ 


ddH 2 0 


62.5uJ 



The resulting mixtures were combined, vortexed and then centrifuged briefly. The genomic DNA was denatured 
by heating the tube at 94°C for 5 min andthen snap cooled on ice for 3 min and overlaid with 75uJ of paraffin oil. 

PCR was initiated using a "hot-start" procedure, whereby tubes containing the reaction mixture without Taq 
polymerase were equilibrated at 72°C in a heat block. Taq polymerase (2.5 units, 0.5uJ) was then added through the 
mineral oil, and the tube was vortexed again, recentrif uged briefly and samples immediately transferred to the thermal 
cycler, previously set at 72°C. The genomic DNA samples were then subjected to 35 PCR cycles, as follows. 

primer extension ! 72°C, 30 sec 
denaturation j 94°C, 30 sec 
primer annealing j 65°C, 60 sec 



The 72°C primer extension time at the end of the final cycle was increased to 10 min. 

The DNA construct was amplified from the M1 3mp18 clone using the same primers as for the genomic DNA under 
study, the reaction mixture being as follows: 



DPB-UHG M13mp18 DNA: 1/100 dilution from supplied stock 


1.0uJ 


UG1 9 primer (5uJv1 stock) 


10.0uJ 


UG21 primer (SuJvl stock) 


10.0uJ 


10X PCR buffer (supplied) 


10.0uJ 


dNTP mix (10mM each dNTP) 


2.0uJ 


ddH 2 0 


66.5uJ 



The same PCR and thermal cycling conditions were used as above for the genomic DNA, but with only 25 cycles. 

Following the PCR of the construct, 5uJ of PCR product was added to 1uJ of 6X gel loading buffer and electro- 
phoresed for 40 min at 70 volts in a 1 .5% (w/v) agarose minige! in 1 X TAE running buffer containing 0.5jjg/ml ethidium 
bromide (Sambrook J, Fritsch EF and Maniatis T (1989) Molecular cloning, A laboratory manual; Cold Spring Harbor 
Laboratory Press) (Sambrook et al 1 989, Supra). The resulting bands were the examined over a U V transilluminator 
and compared with the relative intensities of sample and DPB-construct PCRs and the concentrations adjusted to give 
the same intensity of fluorescence. Appropriate amounts (6-20uJ) of both PCR products were added to a single micro- 
centrifuge tube, and subjected to a further 3 PCR cycles as above. A 12uJ aliquot was analysed for heteroduplexes by 
polyacrylamide minigel electrophoresis as described for the Mini-Protean II gel electrophoresis system (Bio-Rad). In 
this technique, pour a 12% nondenaturing polyacrylamide (29:1) minigel, using 1X TBE running buffer and then add 
8-10uJ of PCR product to 2uJ 6X gel loading buffer containing bromophenol blue and xylene cyanole FF Electrophorese 
at 200 volts for 90-95 min. The exact time will need to be determined experimentally: the homoduplex band should be 
allowed to migrate to within 0.5cm of the end of the gel. Stain gels for 20 min in 1X TBE containing 0.5u.g/ml ethidium 
bromide and examine using a UV transilluminator. 

Following this procedure PCR fingerprints for HLA-DPB1 alleles were obtained (see Figure 2). 

For each HLA-DPB1 allele listed (DPB1*0101 to 2001), one of the two possible heteroduplexes formed with the 
DPB-construct is represented: single nucleotide mismatches are shown as boxes, matched regions as lines. For ex- 
ample in DPB1*01 01, mismatches in complementarity between the coding strand of the DPB1*0101 (upper) sequence 
and the "noncoding" strand of the construct (lower) sequence are shown, and in each case the individual mismatched 
nucleotide on the DPB1*0101 sequence is specified (see key). Nucleotides are numbered from the 5'-end of the DPB 
PCR primer UG19 (SEQ ID NO: 6). Black triangles represent deliberate deletions in the DPB-construct designed to 
increase gel retardation of the heteroduplexes and thereby to produce a clear spread of visible bands (see Figs. 3 and 
4b). The single DPB-construct employed theoretically generates an unique heteroduplex with each DPB1 allele. 

In addition, PCR fingerprints of HLA-DPB1 homozygous typing cells (HTCs) conducted in the presence of the 
DPB-construct were obtained (see Figure 3). In each case an unique PCR fingerprint is observed. Lane 11 shows an 
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left (PAH12L) , S'-TGT GGT TTT GGT CTT AGG AA-3* 
(SEQ ID NO: 11); and 
s right (PAH12R), 5'-AGT CTT CGA TTA CTG AGA AA-3' 

(SEQ ID NO: 12), 

Thermal cycling conditions were as for genomic DNA amplification (see below). The products of six replicate re- 

10 actions were pooled and electrophoresed on two 8% w/v nondenaturing polyacrylamide gels (Protogel, National Di- 
agnostics) as previously described (Bidwell and Hui, 1 990). Gel slices containing the double stranded construct band 
were excised and the DNA extracted by isotachophoresis (Ofverstedt et al, 1984), precipitated with ethanoi, washed 
in 70% ethanol, and redissolved in 50uJ sterile double distilled water. Ten microlitres of a 1 : 10,000 serial dilution was 
amplified for use in the heteroduplex analyses. 

is Genomic DNA was isolated from whole blood according to standard protocols, or alternatively from dried blood 

spots used in neonatal screening tests, as follows: a 2mm 2 area of blood spot was excised from the paper, added to 
a 0.5ml microcentrifuge tube containing 25uJ of a 1:10 dilution of AmpliTaq™ reaction buffer (Perkin Elmer Cetus). 
Samples were heated at 96°C for 15 min, cooled, diluted with a further 25u.l of 1X AmpliTaq™ reaction buffer, mixed 
and centrifuged at 13000g for 15 min. Supernatants were used directly in the PCR. Separate UHG and genomic DNA 

20 PCR amplifications were performed in 1 00u.l reaction mixtures containing 1 OuJ heat-denatured diluted UHG or genomic 
DNA isolated from whole blood (100ng) respectively, 10u1 each of 5u.M solutions of PAH12L and PAH12R primers 
(details above), 2u.l dNTP mix (dATP, dCTP, dGTP and dTTP (Boehringer Mannheim) each at a concentration of 1 0mM), 
58uJ deionized double distilled water, 10u.l AmpliTaq™ reaction buffer and 2 units AmpliTaq™ DNA polymerase (Perkin 
Elmer Cetus). The enzyme was added at 72°C to hot-start the reaction (D'Aquila et al, 1991). For PCR amplifications 

25 of DNA obtained from dried blood spots, 30uJ of supernatant was used in the reaction mixtures, and the volumes of 
AmpliTaq™ reaction buffer and water were reduced to 7uJ and 40.6p1 respectively The reaction mixtures were sub- 
jected to 35 rounds of thermal cycling at 94°C for 1 min, 48°C for 1 min and 72°C for 1 min. 

Fifteen microlitres of PCR-amplified genomic DNA and of PCR-amplified construct were mixed and subjected to 
3 rounds of thermal cycling at 94°C for 1 min, 65°C for 2.5 min and 72°C for 2.5 min (Note: 1 amplification of the 

30 construct was therefore sufficient for 6 reactions with PCR-amplified genomic DNA). The entire mixture was subjected 
to electrophoresis for 90 min at 200V on 12% w/v nondenaturing polyacrylamide minigels (Protogel, National Diagnos- 
tics) and heterodupiexes were visualised by staining with ethidium bromide as previously described (Bidwell and Hui, 
1990). 

Figure 6a, 6b, 7 and 8 show the stained gels for a series of experiments on the of PAH gene exon 1 2 mutations. 

35 in Figure 6(a), Lanes 1 and 12 show molecular weight markers (M, marker VIII, Boehringer Mannheim). The leading 

double bands in all other lanes are the homoduplexes of the synthetic DNA construct (leading band, 176bp) and PAH 
gene exon 12 (trailing band, 185bp) PCR products. Bands of apparent molecular weight 400-500bp are different con- 
formational forms of DNA heterodupiexes (Figs 6b and 8). Lanes 2, 11 , 1 3 and 16 show the heterodupiexes observed 
in normal individuals (controls). Other lanes show the variations in heterodupiexes observed in PAH gene exon 12 

40 mutations, denoted beneath each lane: horn, individual homozygous for mutation; net, individual heterozygous for an 
exon 1 2 mutation (non-exon 1 2 PAH gene mutation present on other chromosome). Lanes 6, 8 and 1 4 show individuals 
with different PAH gene exon 1 2 mutations on each chromosome. In all cases, characteristic heterodupiexes (also see 
Figure 6b) are observed for each mutation, and in addition the homozygosity or heterozygosity status is readily deter- 
mined. 

45 Figure 6(b) is a schematic representation of DNA heteroduplex banding patterns obtained for each PAH gene exon 

12 mutation. For each mutation, banding patterns are shown for homozygous (horn) and heterozygous (het) states. 
The mobilities of the bands (apparent molecular weights) are: a, 446bp; b, 433bp; c, 416bp; d, 395bp; e, 374bp; f, 
359bp; g, 354bp. Bands c plus g are observed in normal PAH genes and in genes with non-exon 12 mutations. 
The results shown in Figure 8 were obtained in the same way as Figures 7a and 7b and show the classification 
50 of mutations in PKU individuals using DNA isolated from dried blood spots. C, normal control; M, molecular weight 
markers (for details see Fig 2 legend); lane 1 , individual with R408W and I VS1 2nt1 mutations on opposite alleles; lanes 
2, 3, 5 and 6, IVS12nt1 heterozygous individuals; lane 4, IVS12nt1 homozygous individual. 

Figures 6a and 6b and 8 show that the synthetic construct was able to generate unique and discriminatory DNA 
heterodupiexes for all five known exon 12 mutations within DNA from PKU individuals, both in the homozygous and 
55 heterozygous states. Moreover, the visual quality of these results compared favourably with those obtained with DNA 
isolated from fresh blood samples, and permitted unambiguous assignment of exon 12 genotypes. 

The reliability and reproducibility of the test was examined by screening two sets of genomic DNA samples collected 
from PKU individuals in whom exon 12 mutations were established by other methods, either prospectively or retro- 
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spectivety. 

Figures 7a and 7b show the results of a blind trial of concordance between the DNA analysis method of the present 
invention and the results of other DNA typing analyses. Figure 7a represents the results using the first set of genomic 
DNA whilst Figure 7b represents the results using the second set of genomic DNA constituting untreated, PKU indi- 
viduals with reduced intelligence from a Scottish population. In both Figures 7a and 7b, lane M are molecular weight 
markers (for details see Figure 6 legend) and lane C is a normal control. In Figure 7a, the lanes were as follows: lane 
1, R408W heterozygous individual; lanes 2, 8, 9 and 11, IVS12nt1 heterozygous individuals; plane 3, individual with 
Y414C and IVS12nt1 mutations on opposite alleles; lane 4, Y414C heterozygous individual; lane 5, IVS12nt1 homozy- 
gote; lane 7, individual with R408W and IVS12nt1 mutations on opposite alleles. Lanes 6, 10 and 1 2 are PKU individuals 
with mutations outside exon 12. In Figure 7b, the lanes are as follows: lanes 1-7 and 9, IVS12nt1 heterozygous indi- 
viduals; lane 8, PKU individual with mutation outside exon 12. For the first set of genomic DNA (Figure 7a) there was 
complete concordance between results of the PAH-UHG tests and prospectively performed analyses using allele spe- 
cific oligonucleotide (ASO) probes or DNA sequencing. 

In the second set of genomic DNA, constituting untreated, PKU individuals with reduced intelligence from a Scottish 
population, 8 out of 9 samples were identified as heterozygous for the IVS12nt1 mutation (Figure 3b); these results 
were confirmed by ASO analysis (data not shown), and are not unexpected for this population (Sullivan et al, 1989). 

A practical consideration of the potential application of this technology is in the ability to rapidly genotype infants 
who are positively diagnosed in the neonatal screen for PKU. We addressed this by analysing genomic DNA prepared 
from a 2mm 2 area of dried blood spots normally used in the neonatal screening programme. These samples had been 
obtained from affected individuals who regularly send dried blood spots to the screening laboratory for monitoring blood 
phenylalanine concentrations. The samples were between 1 to 4 months old and had been left at ambient temperatures 
during that time. Samples were analysed from PKU individuals whose genotype had been established previously using 
ASO probes or DNA sequencing. In all cases, the visual quality of these results (Figure 8) compared favourably with 
those obtained with DNA isolated from fresh blood samples, and permitted unambiguous assignment of exon 12 gen- 
otypes. 

This Example shows the technique of the present invention to be a highly sensitive, rapid and non-radioactive 
screening and classification system for PKU in which five mutations in exon 12 of the PAH gene can be identified using 
a single DNA construct and one PCR-based test. The tests are technically simple to perform and suitable for the routine 
screening of large numbers of individuals. Results were obtained within 24 hours of receipt of blood samples, and were 
of equal visual quality when performed on whole blood (Figures 6 and 7) or dried blood spots (Figure 8). Evidence 
from this and parallel studies being conducted for other genetic diseases, suggests that previously undescribed mu- 
tations within sequences under study may be identified initially by an unmodified DNA construct. Following DNA se- 
quence analysis to identify the new mutation, specific modifications to the construct, derived from molecular modelling 
simulations, can optimise the identification of the new mutation in subsequent tests. The conformational and surface 
charge modifications induced in heteroduplexed DNA by constructs may be modelled. An example of the homoduplexed 
and synthetic construct-heteroduplexed nucleotides between codons 407 and 410 of exon 12 of the PAH gene using 
in the model the construct sequence which identifies the R408W mutation is shown in Figure 9. Conformational differ- 
ences include altered interatomic distances and angles, increased diameter of the helix (compare diameters in (b)), 
and changes in net surface charge. The changes in electrophoretic mobilities as a consequence of such modifications 
may be predicted to a large extent from such theoretical models prior to actual construction of specific synthetic con- 
structs. This Example illustrates the wide potential of synthetic DNA construct-based DNA heteroduplex technology. 
The technique appears ideally suited to the genotyping of multi^allelic genes where mutations are clustered within a 
reasonable distance and are flanked by conserved sequences suitable for PCR priming. Genotyping mutations within 
separate exons in genomic DNA may be facilitated using exon-specrfic constructs, or potentially by a single construct 
for genotyping cDNA derived from mRNA by reverse transcription PCR (Myers and Gelfand, 1991) 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: The University of Bristol 

(B) STREET: Senate House, Tyndall Road 

(C) CITY: Bristol 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): BS8 1TH 

(ii) TITLE OF INVENTION: NUCLEIC ACID ANALYSIS 
(Hi) NUMBER OF SEQUENCES: 12 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
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GCTGCAGGAG AGTGGCGCCT CCGCTCATGT CCGCCCCCTC CCCGCAGAGA 
ATTCAGTGTA 60 

CCACTTACGG CAGGAATGCT ACGCGTTTAA TGGGACACAG CGCTTCCTGG 
AGAGATACAT 120 

CTACAACCGG GAGGAGTTCG TGCGCTTCGA CAGCGACGTG GGGGAGTTCC 
GGGCGGTGAC 180 

GGAGCTGGGG CGGCCGCATG AGGACAACTG GAACAGCCAG AAGGCACTCC 
TGGAAGCGGG 240 

CAGTGCCGGA CAGGGAATGC AGACACAACT ACGAGCTGGA CGATGCCGTG 
ACCCTGCAGC 300 

GCCGAGGTGA GTGAGGGCTT TGGGCCGGAt CCG 
333 

(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GCTGCAGGAG AGTGGCGCCT CCGCTCATGT CCGCCCCCTC CCCGCAGAGA 
ATTCAGTGTA 60 

CCACTTACGG CAGGAATGCT ACGCGTTTAA 
90 

(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CCGGAACTCC CCCACGTCGC TGTCGAAGCG CACGAACTCC TCCCGGTTGT 
AGATGTATCT 60 

CTCCAGGAAG CGCTGTGTCC CATTAAACGC GT 
92 
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(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GGAGTTCCGG GCGGTGACGG AGCTGGGGCG GCCGCATGAG GACAACTGGA 
ACAGCCAGAA 60 

GGCACTCCTG GAAGCGGGCA GTGCCGGACA G 
91 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CGGATCCGGC CCAAAGCCCT CACTCACCTC GGCGCTGCAG GGTCACGGCA 
TCGTCCAGCT 60 

CGTAGTTGTG TCTGCATTCC CTGTCCGGCA 
90 



(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GCTGCAGGAG AGTGGCGCCT CCGCTCAT 
26 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGGATCCGGC CCAAAGCCCT CACTC 
25 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



TGTGGTTTTG GTCTTAGGAA CTTTGCTGCC ACAATACCTC TTACCTTCTC 
AGCCGGCTGC 60 

ATACACCCAA AGGATTGAGG TCTTGGACAA TACCCAGCAG CTTAAGATTT 
TGGCTGATTC 120 

CATTAACAGA CCTTAATTTA CACCTTACGA GGCCACTCGG TTTCTCAGTA 
ATCGAAGACT 180 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



TGTGGTTTTG GTCTTAGGAA CTTTGCTGCC ACAATACCTC ACCTTCTCAG 
CCGGCTACAT * 60 

ACACCCAAAG GATTGAGGTC TTGGACAATA CCCAG 
95 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



AGTCTTCGAT TACTGA6AAA CCGAGTGGCC TCGTAAGGTG TAAATTAAGC 
TGTTAATGGA 60 

ATCAGCCAAA ATCTTAAGCT GCTGGGTATT GTCCAAG 
97 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



TGTGGTTTTG GTCTTAGGAA 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



AGTCTTCGAT TACTGAGAAA 
20 



Claims 

1. A method for examining polymorphisms in a nucleotide sequence of a first sample of nucleic acid comprising the 
steps of: 

(a) forming a population of nucleic acid fragments bearing the said nucleotide sequence; 

(b) combining the population of nucleic acid fragments with a population of a synthetic nucleotide construct 
which construct is capable of forming duplexes with the nucleic acid fragments, the sequence of the construct 
being such that duplexes of different molecular conformation are formed between the construct and the nucleic 
acid fragments dependent upon the presence of a polymorphism at a known variable nucleotide or sequence 
of nucleotides within the sequence of the sample under examination; 

(c) permitting duplex formation within the combined populations; and 

(d) separating the duplexes formed in step (c); 

wherein the synthetic construct contains one or more deliberate nucleotide substitution(s) and/or deletion(s) and/ 
or insertion(s) which is/are (i) opposite to a known variable nucleotide or sequence of nucleotides within the se- 
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quence under examination and/or (ii) contiguous with a nucleotide which is opposite to a known variable nucleotide 
or sequence of nucleotides within the sequence under examination. 

2. A method according to claim 1 , wherein the nucleotide sequence of the synthetic construct at least contains de- 
s liberate nucleotide substitution(s) and/or deletion(s) and/or insertion(s) contiguous with the nucleotide which is 

opposite the known variable nucleotide or sequence of nucleotides within the sequence under examination. 

3. A method according to claim 1 or 2, wherein the nucleotide sequence of the synthetic construct contains deliberate 
nucleotide substitution(s) and/or deletion(s) and/or insertion(s) opposite the known variable nucleotide or sequence 

10 of nucleotides. 

4. A method according to claim 3, wherein the synthetic construct includes nucleotide substttution(s) and deletion(s) 
both opposite and contiguous with the mutation. 

is 5. A method according to any one of claims 1 to 4, wherein the nucleotide sequence of the synthetic construct contains 
between one and ten contiguous substitutions/deletions adjacent the position of the polymorphism in the nucleotide 
sequence under investigation 

6. A method according to any one of claims 1 to 5, wherein the synthetic construct includes deliberate nucleotide 
20 substitution(s) and/or deletion(s) and/or insertion(s) corresponding to at least two known polymorphisms in the 

nucleotide sequence 

7. A method according to any preceding claim, wherein the construct includes a deliberate deletion or a series of 
deletions, at a position away from polymorphic sites in the nucleotide sequence under investigation. 

25 

8. A method according to any preceding claim, wherein the nucleic acid under examination is DNA. 

9. A method according to any preceding claim, wherein the synthetic nucleotide construct is a synthetic DNA. 

30 10. A method according to any preceding claim, wherein the population of DNA fragments bearing the nucleotide 
sequence is formed from a DNA source by a polymerase chain reaction. 

11. A method according to any preceding claim in which the synthetic construct is a DNA construct, wherein the pop- 
ulation of synthetic DNA construct is formed by a polymerase chain reaction. 

35 

12. A method according to any preceding claim for matching individuals in respect of one or more of the functional 
proteins expressed by the HLA (human leucocyte antigen) class I and II genes, wherein the nucleotide sequence 
under examination is a sequence, or part of a sequence, of a HLA Class I or II gene containing a known polymorphic 
site or sites. 

40 

13. A method according to any one of claims 1 to 11 , for determining an individual's susceptibility to a genetic disease, 
wherein the nucleotide sequence under examination is a sequence, or part of a sequence, of a gene containing 
a known polymorphic site or sites associated with a genetic defect. 

45 14. a method according to claim 13, wherein the genetic disease is phenylketonuria genotyping and the gene con- 
taining a known polymorphic site or sites is the phenylalanine hydroxylase (PAH) gene. 

15. A test kit, which kit comprises: 

so (a) two oligonucleotide primers suitable for use in PCR and capable of annealing to complementary sequences 

at respective ends of a nucleotide sequence to be examined; 

(b) a synthetic nucleotide construct which construct is capable of forming duplexes with the nucleotide se- 
quence under consideration, the sequence of the construct being such that duplexes of different molecular 
conformation are formed between the construct and the nucleotide sequence under examination dependent 

55 upon the presence of a polymorphism at a known variable nucleotide or sequence of nucleotides within the 

sequence under examination; and 

(c) a control DNA and/or control PCR amplification product; 
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wherein the synthetic construct contains one or more deliberate nucleotide substitution(s) and/or deletion(s) and/ 
or msertion(s) which is/are (i) opposite to a known variable nucleotide or sequence of nucleotides within the se- 
quence under examination and/or (ii) contiguous with a nucleotide which is opposite to a known variable nucleotide 
or sequence of nucleotides within the sequence under examination. 

i. A test kit according to claim 15, wherein the synthetic nucleotide construct is a synthetic DNA. 



Patentanspruche 



1. 



7. 



Verfahren zur Untersuchung von Polymorphismen in einer Nucleotidsequenz einer ersten Nucleinsaureprobe urn- 
fassend die Schritte: 



(a) Bilden einer Population aus Nucleinsaurefragmenten, die die Nucleotidsequenz enthalten, 

(b) Vereinen der Population aus Nucleinsaurefragmenten mit einer Population eines synthetischen Nucleotid- 
konstrukts, das mit den Nucleinsaurefragmenten Duplexe bilden kann, wobei die Sequenz des Konstrukts so 
ist, dass Duplexe mit verschiedenen Molekulkonformationen gebildet werden konnen zwischen dem Konstrukt 
und den Nucleinsaurefragmenten, abhangig vom Vorliegen eines Polymorphismus an einem bekannten va- 
nablen Nucleotidoder einer bekannten variablen Nucleotidsequenz in der Sequenz der untersuchten Probe, 

(c) Zulassen der Duplexbildung innerhalb der vereinten Populationen, und 

(d) Trennen der in Schritt (c) gebildeten Duplexe, wobei das synthetische Konstrukt ein oder mehrere gezielte 
Nucleotid-Substitutionen, -Deletionen und/oder -Insertionen enthalt, die (i) gegenuber einem bekannten va- 
riablen Nucleoid Oder einer bekannten variablen Nucleotidsequenz sind in der untersuchten Sequenz und/ 
Oder (h) nebst einem Nucleotid sind, das einem bekannten variablen Nucleotid oder einer bekannten variablen 
Nucleotidsequenz gegenuber ist in der untersuchten Sequenz. 

2. Verfahren nach Anspruch 1 , wobei die Nucleotidsequenz des synthetischen Konstrukts mindestens ein oder meh- 
rere gezielte Nucleotid-Substitutionen, -Deletionen und/oder -Insertionen enthalt nebst dem Nucleotid das qeqen- 
uber dem bekannten variablen Nucleotid oder der bekannten variablen Nucleotidsequenz ist in der untersuchten 

Sequenz. 

3. Verfahren nach Anspruch 1 oder 2, wobei die Nucleotidsequenz des synthetischen Konstrukts ein oder mehrere 
gezielte Nucleotid-Substitutionen, -Deletionen und/oder -Insertionen enthalt, gegenuber dem bekannten variablen 
Nucleotid oder der bekannten variablen Nucleotidsequenz. 

4. Verfahren nach Anspruch 3, wobei das synthetische Konstrukt ein oder mehrere Nucleotid-Substitutionen und 
-Deletionen enthalt, sowohl gegenuber als auch nebst der Mutation. 

5. Verfahren nach einem der Anspruche 1 bis 4, wobei die Nucleotidsequenz des synthetischen Konstrukts zwischen 
1 und 10 angrenzende Substitutionen/Deletionen enthalt nebst der Position des Polymorphismus in der untersuch- 
ten Nucleotidsequenz. 

6. Verfahren nach einem der Anspruche 1 bis 5, wobei das synthetische Konstrukt ein oder mehrere gezielte Nu- 
cleot.d-Substrtut.onen, -Deletionen und/oder -Insertionen enthalt, die mindestens zwei bekannten Polymorphismen 
in der Nucleotidsequenz entsprechen. 

Verfahren nach einem der vorstehenden Anspruche, wobei das Konstrukt eine gezielte Deletion enthalt oder eine 
Reihe von Deletionen an einer Stelle, entfernt von den polymorphen Bereichen in der untersuchten Nucleotidse- 

quenz. 



8. Verfahren nach einem der vorstehenden Anspruche, wobei die untersuchte Nucleinsaure DNS 



ist. 



9. Verfahren nach einem der vorstehenden Anspruche, wobei das synthetische Nucleotidkonstrukt eine synthetische 
DNS ist. 
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1 0. Verfahren nach einem der vorstehenden Anspruche, wobei die Population aus DNS-Fragmenten, die die Nucleo- 
tidsequenz enthalten, durch eine Polymerasekettenreaktion aus einer DNS-Quelle gebildet wird. 

11. Verfahren nach einem der vorstehenden Anspruche, bei dem das synthetische Konstrukt ein DNS-Konstrukt ist, 
s wobei die Population aus synthetischem DNS-Konstrukt durch eine Polymerasekettenreaktion gebildet wird. 

12. Verfahren nach einem der vorstehenden Anspruche zum Zuordnen von Individuen bezuglich ein oder mehrerer 
funktionaler Proteine, die von HLA (Human-Leucocyten-Antigen)-Genen der Klasse I und II exprimiert werden, 
wobei die untersuchte Nucleotidsequenz eine Sequenz oder ein Sequenzteil eines HLA-Gens der Klasse I oder 

10 || ist, das den oder die bekannten polymorphen Bereiche enthalt 

13. Verfahren nach einem der Anspruche 1 bis 11 , zur Feststeilung, ob eine Person eine Veranlagung fur eine gene- 
tische Krankheit hat, wobei die untersuchte Nucleotidsequenz eine Sequenz oder ein Sequenzteil eines Gens ist, 
das ein oder mehrere bekannte poiymorphe Bereiche enthalt, die mit einem Gendefekt verbunden sind. 

15 

14. Verfahren nach Anspruch 1 3, wobei die genetische Krankheit der Phenylketonurie-Genotyp ist und das Gen, das 
ein oder mehrere bekannte poiymorphe Bereiche enthalt, das Phenylalanin-Hydroxylase(PAH)-Gen ist. 

15. EinTestkit, umfassend 

20 

(a) zwei Oligonucleotid-Primer, geeignet zur Verwendung in der PCR und zum Tempern an komplementare 
Sequenzen an den jeweiligen Enden einer untersuchten Nucleotidsequenz, 

(b) ein synthetisches Nucleotidkonstrukt, das Duplexe mit der untersuchten Nucleotidsequenz bilden kann, 
25 wobei die Sequenz des Konstrukts so ist, dass Duplexe verschiedener Molekuikonformationen gebildet wer- 

den konnen zwischen dem Konstrukt und der untersuchten Nucleotidsequenz, abhangig vom Vorliegen eines 
Polymorphismus an einem bekannten variablen Nucleotid oder einer bekannten variablen Nucleotidsequenz 
in der untersuchten Sequenz, und 

30 (c) ein Kontroll-DNS- und/oder Kontroll-PCR-Amplif ikationsprodukt, wobei das synthetische Konstrukt ein oder 

mehrere gezielte Nucleotid-Substitutionen, -Deletionen und/oder -Insertionen enthalt, die (i) einem bekannten 
variablen Nucleotid oder einer bekannten variablen Nucleotidsequenz gegenuber sind in der untersuchten 
Sequenz und/oder (ii) nebst einem Nucleotid sind, das einem bekannten variablen Nucleotid oder einer be- 
kannten variablen Nucleotidsequenz gegenuber ist in der untersuchten Sequenz. 

35 

16. Ein Test-Kit nach Anspruch 15, wobei das synthetische Nucleotidkonstrukt eine synthetische DNS ist. 



Revendications 

40 

1. Procede d'examen des polymorph ismes dans une sequence nucleotidique d'un premier 6chantillon d'acide nu- 
cleique, comprenant les etapes consistant a : 

(a) former une population de fragments d'acide nucleique portant ladite sequence nucl6otidique ; 

45 (b) combiner la population de fragments d'acide nucleique avec une population d'un produit d'assemblage 

nucleotidique de synthese, lequel produit d'assemblage est capable de former des duplex avec les fragments 
d'acide nucleique, la sequence du produit d'assemblage 6tant telle que des duplex de conformation motecu- 
laire difterente sont formes entre le produit d'assemblage et les fragments d'acide nucleique en fonction de la 
presence d'un polymorphisme sur un nucleotide ou une sequence de nucleotides variables connus au sein 

so de la sequence de l'6chantillon en cours d'examen, 

(c) permettre la formation de duplex au sein des populations combinees ; et 

s6parer les duplex formes a I'etape (c) ; 

etant entendu que le produit d'assemblage de synthese contient une ou plusieurs substitutions et/ou deletions 
55 et/ou insertions nucieotidiques deiib6rees qui sont (i) opposees a un nucleotide ou une sequence de nucleotides 

variables connus au sein de la sequence en cours d'examen et/ou (ii) contigues a un nucleotide qui est oppose a 
un nucleotide ou une sequence de nucleotides variables connus au sein de la sequence en cours d'examen. 
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2. Procede selon la revendication 1 , ou la sequence nucleotidique du produit d'assemblage de synthase contient au 
moms une ou des substitutions et/ou deletions et/ou insertions nucleotidiques delibSrees contigues au nucleotide 
qui est oppose au nucleotide ou a la sequence de nucleotides variables connus au sein de la sequence en cours 
d'examen. 

3. Precede selon la revendication 1 ou 2, ou la sequence nucleotidique du produit d'assemblage da synthese contient 
une ou des substitutions et/ou deletions et/ou insertions nucleotidiques deliberees opposees au nucleotide ou a 
la sequence de nucleotides variables connus. 

4. Procede selon la revendication 3, ou le produit d'assemblage de synthese comprend une ou des substitutions et 
une ou des delations nucleotidiques opposees et contigues a la mutation. 

5. Procede selon Tune quelconque des revendications 1 a 4, ou la sequence nucleotidique du produit d'assemblaqe 
de synthese contient entre 1 et 10 substitutions/deletions contigues, adjacentes a la position du polymorphisme 
dans la sequence nucleotidique en cours d'etude. 

6. Procede selon I'une quelconque des revendications 1 a 5, ou le produit d'assemblage de synthese comprend une 
ou des substitutions et/ou deletions et/ou insertions nucleotidiques deliberees correspondant a au moins deux 
polymorphismes connus dans la sequence nucleotidique. 



7. 



8. 



9. 



Procede selon I'une quelconque des revendications precedentes, ou le produit d'assemblage contient une deletion 
ou une serie de deletions deliberees a une position distante de sites polymorphes dans la sequence nucleotidique 
en cours d'etude. 

rADN^ Sel0n ' qUelCOnqUe d6S revendica,ions precedentes, ou Cacide nucleique en cours d'examen est 

Procede selon Tune quelconque des revendications precedentes, oi. le produit d'assemblage nucleotidique de 
synthese est un ADN de synthese. 

10. Procede selon I'une quelconque des revendications precedentes, ou la population de fragments d'ADN portant la 
sequence nucleotidique est formee a partir d'une source d'ADN par amplification en chatne par polymerase. 

11. Procede selon I'une quelconque des revendications precedentes, oi. le produit d'assemblage de synthese est un 
produit d'assemblage d'ADN ou la population de produit d'assemblage d'ADN de synthese est formee par ampli- 
fication en chatne par polymerase. 

12. Procede selon I'une quelconque des revendications precedentes, destine a apparier des individus en fonction 
d une ou plusieurs des proteines fonctionnelles exprimees par les genes de HLA (antigene leucocytaire humain) 
de classe I et II, oil la sequence nucleotidique en cours d'examen est une sequence ou une partie d'une sequence 
d un gene de HLA de classe I ou II contenant un ou des sites polymorphes connus. 

13. Procede selon I'une quelconque des revendications 1 a 11 , destine a determiner la sensibility d'un individu a une 
maladte genetque, ou la sequence nucleotidique en cours d'examen est une sequence ou une partie d'une se- 
quence d'un gene contenant un ou des sites polymorphes connus associes a une anomalie gen&ique. 

14. Procede selon la revendication 13, ou la maladie genetique est le genotypage de la phenylcetonurie et le gene 
contenant un ou des sites polymorphes connus est le gene de la phenylalanine hydroxylase (PAH). 

15. Kit de dosage, lequel kit comprend . 

(a) deux amorces oligonucleotidiques utilisables en PCR et capables de s'hybrider a des sequences comple- 
mentaires aux extremes respectives d'une sequence nucleotidique a examiner; 

(b) un produit d'assemblage nucleotidique de synthese, lequel produit d'assemblage est capable de former 
des duplex avec la sequence nucleotidique consideree, la sequence du produit d'assemblage etant telle que 
des duplex de conformation moleculaire differente sont formes entre le produit d'assemblage et la sequence 
nucleotidique en cours d'examen en fonction de la presence d'un polymorphisme sur un nucleotide ou une 
sequence de nucleotides variables connus au sein de la sequence en cours d'examen • et 
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(c) un ADN temoin et/ou un produit d'amplification par PCR temoin ; 
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etant entendu que le produ it d'assemblage de synthese contient une ou piusieurs substitutions et/ou deletions 
et/ou insertions nucleotidiques deliberees qui sont (i) opposees a un nucleotide ou une sequence de nucleotides 
variables connus au sein de la sequence en cours d'examen et/ou (ii) contigues a un nucleotide qui est oppos6 a 
un nucleotide ou une sequence de nucleotides variables connus au sein de la sequence en cours d'examen. 

16. Kit de dosage selon la revendication 15, ou le produit d'assemblage nucleotidique de synthese est un ADN de 
synthese. 
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FIG. 3 
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FIG. 7a 
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FIG. 7b 
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FIG. 9 
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